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PREFACE 

The Agriculture and Resources Inventory Surveys Through Aerospace Remote 
Sensing is a multiyear program of research, development, evaluation, and appli- 
cation of aerospace remote sensing for agricultural resources, which began in 
fiscal year 1980. This program is a cooperative effort of the U.S, Department 
of Agriculture, the National Aeronautics and Space Administration, the National 
Oceanic and Atmospheric Administration (U.S. Department of Commerce), the 
Agency for International Development (U.S. Department of State), and the 
U.S. Department of the Interior. 

The work which is the subject of this document was performed withio the Earth 
Resources Research Division, Space and Life Sciences Directorate, at the Lyndon 
B. Johnson Space Center, National Aeronaut ‘-.s and Space Administration. Under 
Contract NAS 9-15800, personnel of Lockheed Engineering and Management Services 
Company, Inc., performed the tasks which contributed to the completion of this 
research. 
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1. INTRODUCTION 


For a previous study (ref. 1), classification maps of corn or noncorn for 26 
Large Area Crop Inventory Experiment (LACIE) corn and soybean sites were 
produced by using the corp profile classification method developed by Dr. G. 
Badhwar of the National Aeronautics and Space Administration (NASA) in Houston, 
Texas (refs. 2 and 3). This classification method incorporates the effects of 
crop- emergence date distribution and bases classification on the temporal 
spectral profile of the crop of Interest. Spectral data Is generated by the 
Landsat multlspectral scanner (MSS)i the crop profile is established for each 
site by local training of one Input training field. Generation of results for 
so large a data set confirmed (1) that this classification method can be 
applied successfully to varying acquisition data distributions and training 
field signatures, and (2) that a method for producing an acceptable 
classification can be defined. Also, the accuracy of the classification 
results, when compared with the field-observation-based Accuracy Assessment 
(AA) digitized ground-truth maps, Indicated that a more detailed analysis of 
results and evaluation of the Badhwar method should be done as an extension of 
the original work. 

In this report, the results of an expanded study into the classification of 
corn using the Badhwar technique is documented. The purpose of this expanded 
study Is (1) to define the probable incidence and Impact of the potential 
problem areas indicated by the first study, and (2) to generate detailed 
Information for use in further evaluation of this classification technique. 

The data set, section 2, is the site data set used in the previous study with 
nine additional corn and soybean sites for which the Accuracy Assessment 
digitized ground truth became available. The software used for this study 
differed from that used for the first study. Specifically, the image data was 
not edited by the use of SCREEN (ref. 4) before classification, although SCREEN 
was applied to the training field data. In addition, a modification was made 
in the formula for calculation of the chi-square threshold values used in 
classification. Software programs are presented in section 3. Section 4 
outlines the procedure used to fulfill the purposes of this study. Potential 
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problem areas are discussed In section 5 with an estimate of probable Incidence 
and Impact on classification results. Selected results of the classifications 
generated for this study are summarized in section 6, Recommendations for 
further appll.atlan of this classification method are given In section 7. A 
brief evaluation of the Badhwar classification method applied to corn Is given 
In section 8. 
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2. DATA SET 


Thirty-four corn and soybean segments were classified using acquisitions for 
which the acquisition distribution over the corn growing season was adequate 
for crop profile definition in each MSS channel. The full segment (22 932 
pixels) was classified. 

2.1 SITE SELECTION 

Geographical distribution of the segments used for this study is illustrated in 
figure 1. Twenty-six segments in the United States Corn Belt were successfully 
processed for evaluation as an aid to quality assurance for the Accuracy 
Assessment digitized ground-truth inventory maps (ref. 5). These segments were 
augmented by nine corn and soybean segments for which the digitized 
ground-truth maps recently became available. One segment In the data set 
(segment 123) could not be processed using the current software. The basic 
site data set used for this study is the 34 segments listed in table 1. This 
table includes LACIE sample segment numbers, locations, AA ground-truth 
percentages of corn, soybeans, sunflower, sorghum, and a comments section which 
identifies scene components (other than corn, soybeans, sunflowers, and 
sorghum) which comprise more than 10 percent of the scene. Scene percentages 
are computed only over the area of the segment identified by ground truth. 

2.2 ACQUISITION SELECTION 

The Badhwar classification method reported in this document requires four or 
five acquisitions in the postemergence to preharvest growth stages of corn. 
Five-acquisition classifications were made for as many segments as possible so 
that results could be compared with classifications generated on a four- 
acquisition subset. This dictated some relaxation of the criteria established 
in the first study, although generally these criteria were followed: 

a. Acquisition distribution must be adequate to define a crop profile. 

b. Acquisitions on cloudy or hazy days should be avoided. However, the effect 
of clouds or haze on crop profiles may be slight, and final judgment of 
acquisition usability was made with reference to graphs of the proposed 
training fields over the available acquisitions. 


2-1 


TABLE 1.- BASIC SITE DATA SET 


Segment 

County, !t«t« 

Corn 

percentage 

Soybean 

percentaoe 

Sunflower 

percentage 

Sorghum 

percentage 

Comment* j 

127 

Montgomery, Ind. 

49.4 

30.6 

0,2 

4 

I 

133 

Whitley, Ind. 

31.1 

17,2 


fir 

lot tree* I 

13* 

Chickasaw, Unit 

38.3 

24.2 


m 

* 

141 

Midi ion, love 

24.1 

18.9 


0.1 

26t pasture 

144 

Wipelto, Iowa 

19.4 

20.1 

* 

0.04 

31 t trees ; 

202 

Atchison, Mo, 

23,5 

32.6 


1.3 

22S pasture I 

20 S 

Clark, Mo, 

17,2 

46.5 

* 

0,02 

IQS tree* j 

209 

(•entry, Mo, 


21,1 

m 

1.6 

SOS pasture and trees 

211 

Grundy, Mo, 

mm 

22.4 


4.9 

5lt hay, tree*, «nd 
pasture 

216 

Mercer, Mo. 

6.6 

19.2 

4P 

1.3 

59% hay, trees, ind 
pasture 

241 

Oeuel, S. Oik, 

25.6 

5.8 

1,2 

0.7 

2 is pasture 

1 

000 

Clinton, I owe 

53.8 

27,3 

M 

0,02 

* ! 

$04 

Marshall, low 

46,0 

28.6 

* 

P 

- 

009 

Ogle, 111. 

53.2 

12.7 

4 

* 

j* 

824 

Iroquoit, 111, 

49.8 

«. o 

4 

0,2 

- 

832 

Ademi, Ind. 

21.6 

39.0 


0,1 

i 

037 

Benton, Ind, 

43,1 

36,0 


* 

• 

042 

Henry, Ind. 

42.6 

28.5 


m 

• 

843 

Henry, Ind. 

32.3 

31.2 

* 


US pasture 

852 

Randolph, Ind. 

27.0 

30.7 


- 

22* pasture and trees 

853 

Randolph, Ind. 

34.9 

30.3 


#4' 

* 

054 

Tippecanoe, Ind. 

49.2 

41.2 


Oe 2 

* 

660 

Uelli, Ind. 

2JU 

31.3 


• 

I3t nonagri- 
cul tural 

864 

Crawford, Iowa 

45.2 

11.7 


* 

17* pasture | 

865 

Crawford, Iowa 

33.1 

14.2 


1.3 

22t pasture j 

877 

Ida, Iowa 

38.3 

19.7 


- 

1H pasture (2QS ground c 
trutn not identified! 

878 

Kossuth, lowe 

43.0 

42.5 


* 

1 

| 

880 

Monona, Iowa 

44.6 

37.8 


<0.7 


881 

Monona, Iowa 

43.5 

7.9 


0,1 

22: pasture ‘ 

882 

Palo Alto, Iowa 

42.9 

38.9 


0.1 

i 

883 

Palo Alto, Iowa 

29.6 

32.0 

- 

0.04 

lit pasture ( 12* ground 
truth not identified) 

886 

Pottawatomie, Iowa 

46,8 

25.5 

- 

0.2 

■ 

891 

Shelby, Iowa 

46.4 

16.8 

- 

0,03 

13' oats 

892 

Shelby, Iowa 

,‘0.1 

14.3 


0.2 

* 
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Figure 1.- Geographical distribution of segments 




















































































c. Acquisitions should occur In the growing season of the crop, although 
considerable variation seems to be acceptable In the definitions of 
postemergence and preharvest. 

2.3 TRAINING FIELD SELECTION 

Ground-truth cornfields were Input to the program as training fields, for 17 
of the segments, training fields were selected by Gary Gutcheskl of NASA; 
training fields for the remaining sites were selected by the author. If 

available, training flairs were used as defined for the first study. 

Candidate training fields were selected on the production film converter (PFC) 
Imagery products using time criteria: 

a. Minimum training field size Is 20 pixels. 

b. Border and edge pixels are excluded from field delineation. 

c. Fields are to be free of roads, drainage patterns, etc.. If possible. 

4. Field should be free of clouds and haze on the acquisitions used. 

Since this study was directed toward problem definition, some of the selection 
criteria defined for the first study were relaxed; fields with unusual signa- 
tures were used as the basis for classification as well as fields which might 
be preemergent or harvested on the selected acquisitions. As expected, crop 
profile definition based on the training fields differed over a given set of 
acquisitions. The effect of this on classification results was assessed In 
connection with the discussion In section 5. Each segment had a maximum of 
five training fields defined; overall, each of 111 training fields was used as 
a base for classification. 



3. SOFTWARE 


All data processing used to generate the classifications was done on the 
programmed data processer model 11/45 (POP-11/45). Several software programs 
were used, and these programs are described In this section. The software 
programs described In this section have also been used successfully for the 
classification of spring wheat (ref. 6). 

3.1 IMUNLD2A 

IMUNLD2A takes an Image unload tape generated on the Earth Resources Inter- 
active Processing System (ERIPS), edits It using SCREEN (ref. 4), adjusts the 
Landsat-3 acquisitions Into a data range comparable to the data range of 
Landsat-2 acquisitions using the Wehmanen multiplicative factors (ref. 7), and 
loads the Images Into a POP-1 1/45 disk. 

• Input: ERIPS Image unload tape. 

• Output: screened, Landsat-3 adjusted Images on a POP-11/45 disk. 

3.2 IMAPLT 

IMAPLT (ref. 8) plots the Individual pixels of a field, giving reflectance 
values versus time (l.e., the acquisition dates specified) for each channel. 
IMAPLT then plots the field mean values In each channel with one standard 
deviation envelope; a curve Is fitted through the mean values. Eight graphs, 
two for each MSS channel, are produced for a field over a set of acquisitions. 
Graphs are displayed on the Image-100 Tektronix screen, and hardcopies are made 
automatically. Listed on the first plot are the segment number, the acquisi- 
tions used, the coordinates of the field, the channel number, number of pixels 
In the field, and the mean and standard deviation on each acquisition. Listed 
on the second plot are the constant values computed from the data for the model 
(with the estimated error), the estimated field planting date (with error), the 
values of the fitted curves at the specified acquisitions (which can be 
compared with the computed mean values of the data), and the chi-square value 
for the fit of the approximating curve to the field data. 
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• Input: field coordinates (line, pixel) In order; acquisition set of four or 
five acquisitions. 

• Output: eight graphs as above. 

3.3 CLASFYT 

CLASFYT (ref. 9) computes the constants for the curves from the training field 
data, compares (with this crop profile In each channel) the values for each 
pixel In the segment 1 , and rejects those pixels which are not within a 
specified chi-square measure of the profile. The technique for rejection Is to 
compare pixel channel values with the profiles In channel 2, channel 3, channel 
4, and channel 1 In succession, and to reject If the comparison In any single 
channel Is Inadequate. Variability of the time of planting/emergence Is 
allowed for In the comparison of Individual pixels with the corp profile (refs, 
2 and 3). Accepted pixels are labeled corn, and rejected pixels are labeled 
noncorn. 

• Input: five- or four-image files; coordinates of one crop-of- Interest field 
to establish crop profiles; Initial values for the function constants as 
computed In IMAPLT (to aid convergence of the approximating curve). 

• Output: classification file on disk which has a designation of corn or 
noncorn for each pixel In the segment; llneprlnter sheet summarizing the 
following: 

a. The acquisitions used, 

b. The training field coordinates and the number of pixels In the field. 

c. The mean and standard deviation for each channel and each acquisition 
(field averages). 


1 — 

As each Image was unloaded from an ER IPS Image unload tape onto a disk for 
processing on the PDP 11/45, It was edited using the ERIM program SCREEN, a 
procedure for automatically detecting garbled data, clouds, snow, cloud shad- 
ows, and water In MSS data. Pixels In the training field that failed to pass 
this edit step were excluded from processing and did not affect the crop pro- 
files. However, screened pixels were restored before classification of the 
segment, so all 22 932 pixels are designated as corn or noncorn. 



d. The Input and the final constants (with error) for the model. 

e. The, final chi-square values for each channel (training field data), 

f. The estimated planting date of the training field (with error) as 
derived for each channel. 

g. The chi-square thresholds In each channel applied as cutoff values In 
classification. 

h. The number of pixels cut and removed from consideration as corn for 
exceeding the chi-square threshold In each channel. 

1. The final numerical results: the number of pixels classified as corn, 
the number of pixels screened (always "0" In this study), and the number 
of pixels rejected as corn. 

3.4 MISMAP 

MISMAP (ref. 9) compares the classification file produced by CLASFYT with the 
AA digitized ground- truth Inventory map for the segment. A numerical scene 
summary Is given In confusion matrix form. A llneprlnter map Is generated with 
this code: 

a. Ground- truth corn classified as corn appears as C. 

b. Ground- truth noncorn rejected as corn Is left blank. 

c. Ground-truth noncorn classified as corn appears as +. 

d. Ground- truth corn rejected as corn appears as -. 

e. Pixels for which ground truth Is not available but which are classified as 
corn appear as $. 

f. Pixels for which ground truth Is not available but which are rejected as 
corn appear as %. 

MISMAP maps can be generated for all pixels or for pure (AA definition) pixels 
only. Pure pixels (AA) are those which on a subpixel level contain only one 
crop. 


• Input: classification file from CLASFYT and ground-truth inventory map 
file. 

• Output: full-scene lineprinter map comparing the classification map with 
the ground-truth map and a confusion matrix numerical summary of results. 

3.5 MISMAP1 

MISMAP1 was developed for use in this expanded study of the application of 
Badhwar's classification of corn. This program can be used to compare two 
classification files created by CLASFYT. A numerical comparison is given in 
confusion matrix form. A lineprinter map is generated with this code: 

• Pixels classified as corn on both files appear as C. 

• Pixels rejected as corn on both files are blank. 

• Pixels classified as corn on one file but rejected as corn on the other 
appear either as + or -, depending upon the order of ^ntry of the file 
names. 

MISMAP1 was used to compare the classification files produced by two different 
training fields by using the same set of acquisitions, and it was also used to 
compare a five-acquisition classification with the four-acquisition 
classification that was generated by the same training field. 

• Input: two classification files produced from CLASFYT. 

0 Output: full-scene lineprinter map comparing the classification maps and a 
confusion matrix numerical summary of results. 

3.6 TAPEOUT 

TAPEOUT (ref. 10) reads the data files produced by CLASFYT and creates 
universal format tapes. Black and white film product classification maps are 
produced on the PFC from these tapes. The scale used is the same as that of 
the PFC color imagery. The classification maps produced for this study are 
available for reference,. 

• Input: classification file from CLASFYT 

0 Output: black and white full-scene classification map on film. 
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4. PROCEDURE 


Detailed description of the modeling used In Badhwar classification Is given in 
"A Semi-Automatic Technique for Mult itemporal Classification of a Given Crop" 
by G. Badhwar (ref. 2) and "Crop Emergence Data Determination from Spectral 
Data" by G. Badhwar (ref. 3). Implementation methods are explained in 
"Implementation of Badhwar Classification of Corn/Soybean Segments" by 
W. Austin (ref. 1). The software programs utilized have been presented in 
section 3. The procedure which was used to integrate these factors is outlined 
below. 

a. LACIE corn/soybean segment images were unloaded to a PDP-11/45 disc using 
IMUNLD2A. 

b. Acquisition sets were chosen and candidate training fields of corn (two to 
five per segment) were selected by an analyst using the PFC imagery 
products. 

c. Candidate training fields were graphed over five 2 and over four acquisitions 
using IMAPLT. 

d. Referring to the IMAPLT graphs, the analyst confirmed that (1) the acquisi- 
tion set was adequate for crop profile definition, and (2) candidate train- 
ing fields were larger than 20 pixels and were of reasonable homogeneity. 

For this study, no judgment of acquisition coverage relative to the training 
field growth cycle was made, although this was noted. A consistent set of 
acquisitions was used to classify all training fields so that the effect of 
using "early" or "late" (relative to the available acquisitions) fields for 
classification could be examined. Also, no restriction was placed on the 
chi-square measure of the fit of the approximating curve to the training 
data because the usefulness of this chi-square measure as a predictor of 
classification results was to be observed during this study. 

e. Each candidate training field larger than 20 pixels and free of haze and 
clouds on the acquisition set was used as a base for a classification on 


2 

Sample segments 800, 832, and 878 did not have five acceptable acquisitions 
available. 
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five acquisitions and on a four-acquisition subset of this. Acquisition 
sets usually conformed to those reported In reference 1. The restoration of 
pixels removed by SCREEN to the Images before classification Increased the 
available acquisition choice (some acquisitions were avoided In the first 
study because of excessive screening out of the pixels), so some differences 
do exist. In addition, some experimentation was done using early and 
postharvest dates. Training field coordinates and the selected acqulsltons 
were put In a data file and processing was done "batch mode." Two hundred 
and twenty-five classifications were processed. 

f. Each classification was compared with the digitized ground truth, using the 
software program MISMAP. 

g. Classification tables were compiled for each segment. These tables provided 
the detailed Information required for evaluation of this classification 
technique. For each training field and each acquisition set, this 
Information was recorded: 

1. The number of pixels classified as corn. 

2. The ground-truth corn classified as corn (segment percentage, AA pure 
pixels only and all pixels). 

3. The ratio of ground-truth corn rejected as corn to the ground-truth 
noncorn classified as corn (In segment percentage, AA pure pixels only 
and all pixels). 

4. The final values of the constants (A, o, 3) used In the functions which 
define the crop profile In each channel. 

P v (t) - At“exp(-3t 2 ) 

where 

P v (t) Is the reflectance at time (t) 

5. The chi-square measure of curve fit to the training field data for each 
channel profile. 
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6. The chi-square cutoff threshold for each channel used fov rejection of a 
pixel as com with the number of pixels rejected by profile comparison 
In that channel. 

7. Comments relevant to the classifications. 

h. From the tables, the best available set of five and of four acquisitions and 
the training field were selected. These sets for the 34 segments are listed 
In table 2, section 5.1. A preferred acquisition set for each segment was 
selected. 

1. A spot check comparison of disagreement was done on the MISMAP for the 
preferred classification of each segment. The largest percentage of 
classification disagreement with the ground truth Is border and edge pixels; 
wherever an area was In disagreement, this was examined. This analysis was 
done to evaluate the performance of the program and to aid In problem 
definition. 

j. The software program MISMAP1 was used to compare five- and four-acquisition 
classifications based on the same training field, and results were 
Incorporated Into section 5.1. MISMAP1 was also used to compare the 
preferred classification to the classifications based on other training 
fields over the same acquisitions as part of the training field sensitivity 
problem definition (section 5.3). 
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5. DISCUSSION OF POTENTIAL PROBLEM AREAS 


Several potential problem areas were Identified In the study documented In 

reference 1. Two problems were corrected before processing was done for this 

study. 

1. Routine use of SCREEN caused excessive editing of the Images and reduction 
of the number of pixels classified. For this study, pixels removed by 
SCREEN were restored before classification; hence, a classification was made 
for 22 932 pixels In every segment. 

2. The chi-square threshold values In the Individual channels are dependent 
upon the training field data. In the first study* If the training field 
data were approximated very well by the curve, an underclasslflcatlon was 
likely to result. For this study, minimum threshold values were raised 
and the scaling factor In the formula used to calculate the chi-square 
cutoff was changed. 

Problem areas discussed below are these: 

1. The effectiveness of four-acquisition classification compared to 
classification on five acquisitions. 

2. Inadequate acquisition coverage. 

3. An apparent sensitivity of classification results to the choice of training 
field. 

4. The convergence of the approximating curve to the training data. 

5. The effectiveness of the growth cycle adjustment and the adequacy of the 
range specified for variation In planting data. 

6. Identification of reasons for ground-truth corn falling to be classified as 
corn. 


5.1 THE EFFECTIVENESS OF FOUR-ACQUISITION CLASSIFICATION COMPARED TO 

nvr-ic'quisTnoN classification 

5.1.1 METHOD USED TO ASSESS PROBLEM 

Classifications based on 102 training fields using five and four acquisitions 
were generated for this comparison. Selecting comparisons which are signifi- 
cant requires caution. If, to have five acquisitions, an acquisition of poor 
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>Mta quality or a harvest date must be Included, then classification on four 
acquisitions will be more accurate. This situation was checked using the 
software program MISMAP1; differences In classifications on five-acquisition 
and four-acquisition subsets with the same training field usually corresponded 
quite well to clouds or to harvested fields. If, however, five acquisitions of 
good data quality are available, the potential exists for better crop 
separability. Indeed, If a date of significance in crop separability Is 
excluded, crop separability Is lost for the comparable four-acquisition 
classification. 

In the first study, five-acquisition classification was preferred; four- 
acquisition classification was done only If five acquisitions were not 
available or If no training field met the stated criteria using five acquisi- 
tions. Seventeen segments were classified on five acquisitions, and eight 
segments on four acquisitions. For this study, the number of acquisitions 
selected for the same data set was an analyst choice; six segments were 
selected on five-acquisition classification, and 19 were selected on four- 
acquisition classification. Accuracy was Improved. 

Table 2 provides a comparison of five-acquisition and four-acquisition 
classification. Eight of the segments listed (135, 144, 202, 800, 832, 878, 
891, 892) are not suitable for use as a base In program development, but they 
are Included so comparison could be made on segments with severe processing 
deficiencies. Percentages were calculated by the MISMAP program. This 
Information Is listed for each segment: 

• Segment number and location. 

• Training field coordinates (line, pixel) and training field sizes (pixels). 

• A best-available five-acquisition set and a best-available four-acquisition 
subset of this; the preferred acquisition set is starred. 

• Comparison of the classified proportion estimate to the ground-truth propor- 
tion estimate In segment percentage, AA pure pixels only. The proportion 
estimate does not Include the areas which were not ground-truth Identified. 
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TABLE 2.- A COMPARISON OF FIVE- AND FOUR-ACQUISITION CLASSIFICATION 
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• Ratio of the ground-truth corn classified noncorn to the ground-truth non- 
corn classified corn. AA pure pixels only, segment percentages. 

• The total number of pixels classified as corn. 

• The final value of the constants (A, a, fl) in each channel used to model the 
crop profile. This provides a comparison of the curve approximations of the 

data. 

t Comments, including the percentage of the segment not Identified ground 
truth, and the number of AA pure pixels upon which calculation of the scene 
percentages is based, 

5.1.2 SUMMARY 

If the crop profile, adequately representative of the data and sufficiently 
well defined to provide crop separabllllty, can be generated on four 
acquisitions, classification results seem to be equally good using a set of 
four acquisitions or five acquisitions. The distribution of acquisitions 
relative to the crop growth stage of the training field and the data quality of 
the acquisitions Is more Important than the number of acquisitions used. 

Obtaining Landsat acquisitions of good data quality Is a problem. A 
requirement of five acquisitions for application of Badhwar classification 
would restrict the number of segments that could be classified. Comparable 
results using fewer acquisitions will permit the classification of more 
segments. 

5.2 INADEQUATE ACQUISITION COVERAGE 
5.2.1 METHOD USED TO ASSESS PROBLEM 

Inadequate acquisition coverage was assessed by effect. Thirty-eight corn and 
soybean segments were examined for application of Badhwar classification; two 
were rejected for Inadequate acquisition coverage, one for an allied problem 
(inadequate acquisitions of usable data quality), and one more for processing 
problems related to data quality. Of the remaining 34 segments used in this 
study, eight are not recommended for continued use in program development 
because of Inadequate acquisitions of acceptable data quality and three 
sepents did not have five acceptable acquisitions available for processing. 
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Adequate acquisition coverage of good data quality is a continuing and severe 
problem In the use of Landsat data. Maximum coverage is every 9 days (using 
data from both satellites) and much of this is so adversely affected by 
atmospheric conditions that It cannot be used In classification. 

Badhwar classification can currently be applied to segments where there are a 
minimum of four acquisitions, Including one in the green-up and one in the 
senescent growth stage of the crop; this is comparable to the requirements of 
other classification methods. Badhwar classification tends to be successful 
even with marginal or poor acquisition coverage. For the eight segments listed 
in section 5.1 as rejected for use in future research, misclassification 
averages 21.3 percent for an approximate accuracy of 78,7 percent. 'From 
section 5.1, It can be Inferred that Improved results weie generated for 11 
segments where acquisition requirements were relaxed from five to four. 
Similarly, if requirements could be reduced even further to the absolute 
minimum for curve definition— three acqulsitons— Improved results would be 
expected for 10 additional segments. 

5.2.2 SUMMARY 

Inadequate acquisition coverage of good data quality exists as an important 
limitation for any classification method which uses Landsat data. The Badhwar 
method is not affected more severely than other methods; accuracy appears to 
remain satisfactory even when using very poor acquisition sets. However, if 
the program could be modified to relax restrictions on the number of acquisi- 
tions (and their positions relative to the crop growth stage), classification 
results could be expected to improve since more stringent requirements on data 
quality could be maintained. 

5.3 APPARENT SENSITIVITY OF CLASSIFICATION RESULTS TO THE CHOICE OF TRAINING 

mr 

5.3.1 DEFINITION 

Training field sensitivity refers to a situation where classifications based on 
different training fields differ by more than 2000 pixels classified as corn, 
although the training field signatures are similar, and similar classification 
results would be expected. 
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5.3.2 METHOD USED TO ASSESS PROBLEM 


Nine sites were selected as suitable for investigation of this potential 
problem. These sites had good acquisition coverage and included sites for 
which alternate training field selection had been used effectively as a rework 
tool during the first study. Segment 809 was included as an example of 
training field insensitivity. The 10 sites with acquisition sets are listed on 
table 3. 

The procedure used was described in section 4, with some additional steps. 

• General data quality was noted for each of the acquisitions used. Data 
quality could cause classification differences and would preclude 
attributing these differences to training field sensitivity. 

• Training field signatures on the sequence of PFC imagery were compared in 
detail. Graphs of the training fields were compared in each channel, and 
differences (data dispersion, convergence of the curve to the data, etc*) 
were noted. These differences would be expected to produce differences in 
classification results. 

t Training fields were used as test fields if they were not used to define the 
crop profile and the classified results of these fields were noted. This 
proved to be an effective way to assess overall classification results. 

• MISMAP1 was used to compare the classifications based on different training 
fields with classifications based on the training fields listed on table 2. 
This comparison map of the scene plus the confusion matrix numerical summary 
provided both an area and a statistical comparison of differences. 

Of the 10 segments, the problem could be defined on the segments listed in 
table 4 for the stated acquisition sets and as affecting the listed training 
fields. The ratio of the number of training field combinations exhibiting 
sensitivity to the number of combinations examined for the segment is also 
given. 

Thus, of the 53 combinations examined on these segments, the defined problem 
could be isolated only 17 times. This does not mean that training field 
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TABLE 3.- SITES AND ACQUISITION SETS 


Segment 

County, state 

Five-acquisition set 

Four-acquisition set 

" 127 

Montgomery, Ind. 

161, 

207, 

216, 

243, 

252 

161, 

216, 

243, 

252 

141 

Madison, Iowa 

167, 

212, 

220, 

265, 

274 

167, 

212, 

220, 

265 

809 

Ogle, 111. 

164, 

218, 

244, 

262, 

271 

164, 

218, 

244, 

271 

860 

Wells, Ind. 

160, 

197, 

232; 

251, 

268 

160, 

197, 

232, 

251 

864 

Crawford, Iowa 

159, 

186, 

222, 

231, 

267 

159, 

186, 

231, 

267 

865 

Crawford, Iowa 

168, 

186, 

231, 

249, 

267 

168, 

186, 

231, 

267 

877 

Ida, Iowa 

150, 

186, 

222, 

231, 

267 

186, 

222, 

231, 

267 

880 

Monona, Iowa 

150, 

186, 

222, 

231, 

267 

186, 

222, 

231, 

267 

881 

Monona, Iowa 

159, 

186, 

222, 

231, 

267 

186, 

222, 

231, 

267 

882 

Palo Alto, Iowa 

159, 

186, 

222, 

231, 

267 

159, 

222, 

231, 

267 


TABLE 4.- TRAINING FIELD SENSITIVITY 













sensitivity does not exist, only that it Is very difficult to isolate. An 
examination of 281 combinations over the 34 segments located 18 potential 
incidences of training field sensitivity, only one more than defined on the 10 
segments. 

For this study, training fields which were obviously affected by haze or cloud 
cover on any one of the acquisitions were not used as a base for classification. 
However, classifications were performed on training fields for which the acqui- 
sition distribution relative to the field crop development was poor (or other 
problems existed) and on training fields of varying signatures on the imagery 
and as graphed. Acquisition sets which included hazy or cloudy acquisitions 
were used for classification. However, these factors were considered legitimate 
bases for classification result differences, and training field sensitivity 
could not be defined as the source of differences in results when these factors 
were present. In summary, nine of the 34 sites were selected because they 
indicated a sensitivity of classification results to training field choice 
existed; sensitivity of results to training field choice is defined only if 
results are expected to be similar based on examination of training field 
signature. 

5.3.3 SUMMARY 

Training field sensitivity cannot be isolated and defined using a multipurpose 
data set such as this one. For this data set, there is too much variation in 
training field signature. Training fields which have similar signatures should 
be defined for a set of sites with good acquisition coverage. These fields 
should also conform to some criteria for the fit of the approximating curve to 
the training data. Then, if training field choice affects classification 
results, this will be significant. Sample segment 881 is an example of a 
segment suitable for use in such a study. 

The incidence of this problem is indeterminate, but probably acceptable since 
the impact is minor. An unacceptable level of underclassification or overclas- 
sification is clearly indicated by a loss of field pattern and an increase in 
scattering on the classification map. Test fields could be defined as an aid to 
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classification evaluation; a film product classification map would facilitate 
comparison with the PFC imagery. 

Alternate training field selection as a rework technique is effective, clear, 
and simple. Change of training data to Improve classification results has been 
used with success during all of the LACIE project. 

5.4 CONVERGENCE OF THE APPROXIMATING CURVE TO THE TRAINING FIELD DATA 
5.4.1 METHOD USED TO ASSESS PROBLEM 

All training fields for this study were graphed at least twice using IMAPLT. 
Examination of these graphs and comparison with the chi-square measure of the 
fit of the approximating curve to the Landsat data was used to evaluate this 
problem. Initial definition of this problem, "convergence of the curve to the 
training field data appears incomplete," resolved to "the curve fit to the data 
was marginal because the data Is difficult to fit." This redefined problem 
occurred with more frequency than anticipated; many of the segments exhibited 
data patterns which could not be closely approximated by the modeling function. 

However, in these segments curve fit may be adequate for crop profile definition 

and crop separation, and it may produce a good classification even when the 
chi-square measure of the curve fit to the training data was large and the curve 

approximation aesthetically displeasing. The quality of the curve is important, 

however. Poor curves, when the data defined shallow curves or even lines, 
tended to result in overclassifications even if these were well fitted to the 
data. 


5.4.2 SUMMARY 

Much of the Landsat training field data cannot be closely fitted by a curve; 
atmospheric conditions, cropping practices, and the normal variation in data 
often produce lumpy crop profiles. The local training used in this method is 
sensitive to local conditions, which is an advantage since the lumps tend to 
have significance. The classification method seems to be able to tolerate 
considerable smoothing; i.e., poor fit of the approximating curve to the 
training data. The quality of the curve (the definiteness of the curvature) is 
important for crop separability. 
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In classification, the use of the chi-square measure is very effective. How- 
ever, observation of the chi-square measure of the fit of the curve to the 
training data, an intermediate step, does not seem significant for predicting 
classification results. A low chi-square measure can occur from (a) a good 
curve fit to the training data, (b) disperse training data, or (c) a line 
approximation of the data. A large chi-square measure can occur from (a) a 
smoothed and adequate approximation of the training data or (b) a poor data 
approximation. This chi-square measure must be used with the graphs of the data 
and the crop profile curve; alone, it is not significant. 

It was seldom that all the training fields that were defined for a segment 
exhibited poor fit of the approximating curve to the data, even in the 
experimental mode used for this study. This classification method performs very 
well. When a distinctive crop profile is defined even if training field data 
cannot be closely approximated by a curve, results tend to be good. 

5.5 THE EFFECTIVENESS OF THE GROWTH CYCLE ADJUSTMENT AND THE ADEQUACY OF THE 

rAnge specified f6r var i A t i On In planting date 

5.5.1 METHOD USED TO ASSESS PROBLEM 

Training fields not in use to define the crop profile were used as test fields 
in 10 segments. Classification of these fields provided some indication of the 
effectiveness of the growth cycle adjustment, because the fields tended to 
exhibit different growth stages relative to the acquisition set. For the 
preferred classification of each segment, areas of disagreement with the ground 
truth were examined In detail. This examination located corn which failed to be 
classified as corn and identified a possible reason for the omission, including 
an atypical growth cycle for the omission. 

The growth cycle adjustment performed very well. If a training field were 
selected for which the available acquisitions were well distributed for good 
approximation of the data by a curve, this crop profile curve produced a 
successful classification of fields for which the available acquisitions did 
not— because of differences in planting dates— define a good crop profile. 


5-13 


5.5.2 SUMMARY 


For 28 of the segments, no areas of corn which failed to be classified corn 
could be attributed to failure of the growth cycle adjustment or planting date 
range specified in the program. Six segments, two in Missouri and four ir 
Indiana, exhibited signature differences for corn which might have been 
classified more successfully if the allowed range for planting data had been 
extended. Problems with the growth stage adjustment were rare and seem to be 
limited to a geographic region. 

5.6 IDENTIFICATION OF REASONS FOR GROUND-TRUTH-IDENTIFIED CORN FAILING TO 

BE CLASSiTTHnSim 

5.6.1 METHOD USED TO ASSESS PROBLEM 

The software program MISMAP creates a lineprinter map comparison of a classi- 
fication with the digitized ground truth. The preferred classification for 
every segment was examined in detail. The amount of misclassification using 
this method is small, and the majority of the misclassification is border and 
edge pixels. The areas of ground-truth-identified corn which failed to be 
classified as corn were checked by sampling approximately 500 pixels per 
segment . 

5.6.2 SUMMARY 

Omission errors could usually be traced to unusual corn signatures on the film 
products or graphs or to an apparent inconsistency between the ground-truth 
label and the imagery. Sometimes the source of atypical signatures could be 
traced to agricultural or meteorological causes; for instance, fields were cut 
early for silage, damaged by hail, or contained pumpkins or soybeans which 
confused the late acquisition. The sensitivity of the channel crop profiles to 
episodic events was impressive. It was very easy to tell from the graphs that 
the crop had been affected; tracing the probable agricultural or meteorological 
cause was very difficult. Much more detailed correlation of agricultural events 
to reflectance changes in the Landsat channels needs to be done. 
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Mlsclasslflcatlon, excluding border and edge pixels, Is very small using this 
technique, and. In addition, each segment exhibited different reasons for 
mlsclasslfleatlon. No consistent pattern could be established. 
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6. RESULTS 

Classification results are summarized in table 5 in this section. These results 
are for the sample segments of table 2, the preferred (starred) acquisition set. 
Training fields are as listed on table 2. Percentages are computed based on all 
pixels in the segment, regardless of purity, and the base for all percentage 
calculations is 22 932 pixels. 

Table 5 gives the following: 

a. Segment number and location. 

b. Acquisitions available in the growing season of corn, taken as Julian dates 
130 to 300. Landsat-3 acquisitions are denoted by (3). The presence of 
clouds is indicated by (C), and haze is indicated by (h). Other factors 
affecting data quality are denoted (a) and explained. Acquisitions used to 
produce the classification results are underlined. 

c. A confusion matrix of the classification is given: (1) ground-truth corn 
classified as corn; (2) ground-truth corn classified noncorn; (3) ground 
truth noncorn classified corn; (4) ground-truth noncorn classified as 
noncorn. These percentages are as calculated by the MISMAP program. 

d. The percentage of the segment not ground-truth identified^ 

e. Comments on factors which might affect classification quality, problem areas 
encountered in processing the segment, and (for some segments) an assessment 
of the usefulness of the segment in program development. 

Classification agreement with the ground truth improves as the purity of the 
pixels Included in the results improves. Average disagreement for the preferred 
acquisition set all segments is 17 percent using A. A. pure pixels only; approxi- 
mate accuracy, then, is 83 percent. Average disagreement is 21 percent over all 
pixels; approximate accuracy is 79 percent. 
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TABLE 5.- CLASSIFICATION RESULTS 
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Matrices of average values are given below for AA pure pixels (A) and for all 
pixels (B). 

(A) / 24.8 
\ 7.7 

Pure pixels only, 34 segments All pixels, 34 segments 

/C— C C— N\ /c—c C— n\ 

\N-*-C N— N/ N— -N/ 



Omitting the eight segments with processing deficiencies listed on page 5-2 did 
not have much effect on average disagreement. For pure pixels only, disagree- 
ment averages 17 percent and approximate accuracy is 83 percent (matrix C). For 
all pixels, disagreement averages 19.8 percent for an approximate average 
accuracy of 80.2 percent (matrix D). 


(C) /24.4 


8.7\ (D) /23.2 


11.7' 


8.3 


53.9/ 


8.1 


52. Oi 


Pure pixels only, 26 segments All pixels, 26 segments 
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7. RECOMMENDATIONS 


• More research should be done into the selection process for training fields 
which are the basis of successful classification results. It is recommended 
that this research have a double purpose: 

1. Establishment of criteria for training field selection. If possible, 
criteria should be based on the use of the PFC film products {or inter- 
active console) alone; this would make it possible to omit the IMAPLT 
step used in the procedure. Graphs of the training field could be 
presented with the classification results, and IMAPLT could be reserved 
for use in rework. 

2. Establishment of a set of training fields for a segment which are similar 
in graphic and film product signature. A data set of several such seg- 
ments could be used to decide this question: Do comparable training 
fields generate comparable classification results, or are classification 
results sensitive to the choice of the training field? 

• This method of classification should be extended for multicrop use by 
sequential classification of pixels. Those pixels classified as corn should 
be edited from the image, then a crop profile of soybeans could be used to 
classify the remaining pixels as soybeans or as rejected. A data set suit- 
able for testing this application could be selected from the data set 
presented in this report. Sites selected should have good acquisition 
coverage in the growing season. 

• Inadequate acquisitions of good data quality are a continuing problem in use 
of Landsat data for any classification method. For application of Badhwar 
classification, it is recommended that two potential paths be assessed for 
improved use of the available acquisition coverage. 

1. A computer program using three well -distributed acquistions should be 
developed for establishing the crop profile. Results of four-acquisition 
classification were comparable to those based on five acquisitions; 
perhaps a further reduction to three acquisitions is feasible. 

2. More research should be done into the use of preemergence and postharvest 
acquisitions. In this study, such acquisitions were used when crop • 
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profile definition was poor (i.e., curves tended to be flat) without 
apparent penalty. Relaxation of the postemeergence to preharvest range 
for acquisitions would increase the number of acquisitions available for 
Badhwar classification. 

• A decision to increase the range allowed for the estimated planting date in 
Missouri and Indiana should be assessed by correlating ground-truth corn- 
fields exhibiting unusual signatures with the local cropping practices. 

• Similarly, research should be conducted into the correlation of unusual crop 
profiles in a channel to agricultural and meteorological events. If signifi 
cance can be defined for the observed episodic events, usefulness of the 
channel crop profiles will be increased. 

• The order In which the chi-square cuts in individual channels are applied 
should be examined for possible effect on classification. 
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8. EVALUATION 


The Badhwar classification method applied to corn Is very successful. For this 
report, unsuitable training data were processed In an attempt to exaggerate 
problem areas. The accuracy of the method, even under adverse conditions such 
as Inadequate acquisition coverage, frustrated this approach and confirmed the 
stability of the method. 

Recommendations for extending the application of the program arose from this 
study, but no recommendations can be suggested for Improving the performance of 
the program. This program should be considered operational for the classifica- 
tion of corn. 
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